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In Google's Broad Wake: Taking Responsibility 
FOR Shaping the Global Digital Library 



hy Richard K. Johnson, Senior Advisor, Association of Research Libraries 



I t wasn't long ago that libraries confronted the challenge of how 
besf fo frame licenses for digifal resources fhaf were assembled 
by ofhers and used by local pafrons. Today fhe challenge is 
fumed upside down as a growing number of research libraries 
enfer info agreemenfs fhaf carry fhe collecfions fhe libraries 
fhemselves have builf and nurfured ouf beyond fheir insfifufions 
and info fhe world. 

Announcemenfs of major projecfs fo digitize library collecfions 
are becoming almosf commonplace: 

• Google has inked deals wifh a growing array of research 
libraries for mass digifizafion of public domain and in- 
copyrighf books in fheir collecfions. 

• Fiffy libraries, museums, fechnology companies, and ofhers 
banded fogefher in fhe Open Confenf Alliance (OCA) fo 
underfake mass digifizafion of library collecfions. 

• Microsoff has goffen info fhe game, bofh as parf of OCA and 
via a plan fo digifize public domain maferial provided by 
library parfners for ifs Live Book Search porfal. 

And lef's nof forgef fhe less-publicized arrangemenfs wifh 
players such as fhose involved in fhe Universify of Michigan's Texf 
Creafion Parfnerships: Early English Books Online wifh ProQuesf, 
Evans Early American Imprinfs Collecfion wifh Newsbank/Readex, 
and Eighfeenfh Cenfury Collecfions Online wifh Gale.l 

Agreemenfs involving insfifufions' born-digifal resources also 
are parf of today's mix. For example, ProQuesf offers a service fhaf 
provides online access fo fheses and disserfafions from parficipafing 
universifies. Similarly, BioMed Cenfral and several ofher businesses 
operate insfifufional reposifories for universifies. 
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The significance of such underfakings — mass digifizafion, in particular — has 
been widely recognized and discussed in fhe public media. Buf wifhin academe, 
elation over fhe possibilities of popular access fo many of fhe world's greaf 
libraries has been fempered by some unease abouf whefher fhe deals go far 
enough in advancing or profecfing insfifufional, scholarly, and public inferesfs. 

Do fhese arrangemenfs adequafely prevenf resfricfions on use of public domain 
works? Do fhey advance fhe opporfunities for resource discovery and 
compufafional analysis? Do fhey anficipafe long-ferm preservation needs? Do 
fhey sufficienfly value fhe subsfanfial invesfmenf made by insfifufions fo develop, 
organize, and mainfain collecfions over time? 

For public and privafe insfifufions alike — all fax-privileged or publicly 
funded insfifufions fhaf sfeward culfural herifage and fhe scienfific record — fhere 
are quesfions of whefher confracf ferms fully serve 
fhe public inferesfs and whefher fhey are 
sufficienfly fransparenf. The practical implications 
of failing fo be open are illusfrafed by fhe 
Smifhsonian Insfifufion's agreemenf giving 
Showfime Networks semi-exclusive righfs fo use 
Smifhsonian resources for developmenf of cable 
programming — a confracf fhaf was never released 
fo fhe public. A confroversy swiffly arose over fhe 
special advanfage given fo a single company, an arrangemenf fhaf seemed 
fo many fo benefif fhe Smifhsonian more fhan fhe public. (A Governmenf 
Accounfabilify Office invesfigafion spurred by fhe confroversy concluded 
fhe agreemenf did nof hamper researchers' access.) 

Given fhe growing base of experience in framing such agreemenfs — and fhe 
prospecf of more agreemenfs fo come — fhis is a good fime fo consider fhe issues 
of practice and principle fhaf surround confracfs specifying how library 
collections can be used by ofher parties and whaf can be done wifh fhe digifal 
files creafed or managed by fhose parfies.2 

Sea Change 

Digifizafion of library collecfions was hardly novel when if bursf info fhe public 
consciousness wifh fhe firsf Google deals wifh libraries in lafe 2004. New, 
however, was fhe scale of invesfmenf Google promised fo bring, fhe sweeping 
scope of fheir vision, fhe rapidify wifh which fhey soughf fo achieve fhis, and 
fheir bold approach fo navigating pofenfial legal obsfacles. 

The effecf of Google Book Search, as fhe program is now dubbed, was 
fo puf library digifizafion efforfs info an enfirely new frame of reference. As a 
Universify of Michigan sfafemenf puf if, "[we] had been digifizing works for 
years, buf af a subsfanfially slower pace fhan whaf was anficipafed wifh fhe 
Google projecf."3 

In more respecfs fhan jusf scale, Google's plan was fundamenfally differenf 
from mosf of fhe digifizafion projecfs underfaken previously by libraries. Google 
broughf more fhan money fo fhe fask; fhey also broughf fhe imperatives of 
business — in parficular, fhe need fo profecf fheir invesfmenf and erecf barriers fo 
compefifive efforfs. Google's business requiremenfs meanf resfricfions on how 
fhe digifal confenf would be used. As one blogger puf if, "is if anofher case of 
'Google can crawl everyone else's dafa, [buf] no one can crawl Google's dafa'?"^ 

Library digitization has focused on preserving and curafing high-qualify 
digifal fransformafions of ouf-of-copyrighf works, making full fexfs openly 



From page 1 

1 These arrangements have their 
roots in the earlier microfilming 
of library collections by a 
multiplicity of companies under 
diverse contracfual arrangemenfs. 
The history of such deals with 
micropublishers may offer lessons 
for the present, but it is important 
to recognize that some of fhe 
fundamental issues at stake in the 
networked digital environment 
are quite different. 



While Google's aim is not at odds with the needs 
or goals of the academy —indeed it promises to 
advance information sharing dramatically— 
Google Book Search isn't a perfect substitute 
for library digitization. 



From page 2 

2 Beyond the scope of this article, 
but of no less significance, are the 
strategic implications for libraries 
of mass digitization. An 
expanded understanding is 
needed of looming questions, such 
as: How does mass digitization 
affect scholarship? To what kinds 
of uses do we envision digitized 
resources will be put? What roles 
will libraries play in this new 
environment? What services can 
libraries develop to support 
emerging and evolving uses? 

Can libraries leverage digitization 
to manage print holdings 
differently? How can resource 
management responsibilities 

and domain expertise be 
brought together? 

3 University of Michigan News 
Service, "Google Library 
Partnership," November 14, 2005, 
http: / / www.umich.edu / news / 
index.html?BG / google / statement. 

4 Gomment on "Google 
'Showtimes' the UG Library 
System," Television Archiving 
Blog, comment posted August 13, 
2006, http: / / www.archival.tv/ 
2006/08/ 13/google-showtimes- 
the-uc-library-system / . 
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available for viewing, searching, crawling, downloading, and printing. 

The Google aim, on the other hand, was simply "to make offline information 
searchable online"^ via Google's search engine. While Google's aim is not at 
odds with the needs or goals of the academy — indeed it promises to advance 
information sharing dramatically — Google Book Search isn't a perfect substitute 
for library digitization. And it does not necessarily anticipate the spectrum of 
opportunities and risks presented by digitization. For example, can anyone else 
build services that use the data — not just for indexing and access but also for 
other forms of computation? What will be the different treatments for works 
that are in copyright, orphaned, or in the public domain? 

Donald Waters of the Mellon Foundation is among those who have voiced 
concern that universities and libraries have been shortsighted about the potential 
uses of digitized materials: 

The discussion about commercially supported digitization projects has 
been primarily focused on keyword search to enhance discovery. The 
argument seems to be that the enhanced value of search and discovery 
is more than worth any negative conditions that might be imposed by 
commercial partners. However, there is much more to the usage 
equation that is of potentially great significance to the academy, and the 
danger is that research institutions may shortchange the academy and 
the public over the long term as they make their bargains with these 
firms. If we keep copyright issues to the side and focus only on public 
domain materials, digitization greatly enlarges the evidence base for 
scholarship. New kinds of analytical techniques are needed to exploit 
this new evidence for the advancement of knowledge, and research 
and teaching methodologies need to be developed to understand the 
evidence and train the next generation of scholars how to use it. 

Scholarly evidence, analytical techniques and disciplinary and 
pedagogical methods, which are so essential to the future of higher 
education in this country and elsewhere, can be greatly advanced — or 
seriously limited — depending on what rights the universities and 
libraries have to use, and let other institutions and scholars use, their 
copies of the digitized materials. ^ 

From the perspective of Google and its competitors, keeping a rein on the 
digitized files enables them to protect their investment. Perhaps even more 
significantly, it ensures they don't lose control of in-copyright materials — an 
eventuality that brings with it the risk of illicit use and the corresponding 
certainty of Google being targeted by copyright owners' lawsuits. The 
prominence of this concern is implied by the relatively looser contractual 
controls over public domain works. 

Despite any qualms, institutions were willing Google partners. Most 
described their motivations in similar terms: 



5 



6 

7 



Google Press Center, "Google 
Checks Out Library Books," 
December 14, 2004, http: / / www. 
google.com / press / pressrel / 
print_library.html. 

Donald Waters, e-mail message to 
author, January 10, 2007. 

Kevin Bergquist, "Google Project 
Promotes Public Good," University 
of Michigan University Record, 
February 13, 2006, http: / / www. 
umich.edu/ ~urecord/0506/Febl3 
_06 / 02.shtml. 



• University of Michigan President Mary Sue Coleman described the 
partnership with Google as offering "three overarching qualities that 
help fulfill our mission: the preservation of books; worldwide access 
to information; and, most importantly, the public good of the diffusion 
of knowledge."^ 

• "'The digitization project furthers [the University of California's] mission,' 
said UC President Robert C. Dynes. 'It greatly expands our ability to give 
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scholars and the public access to the kinds of information and ideas that 
drive scholarly innovation and public knowledge and discourse/"^ 

• "The Bodleian Library's mission, from ifs founding in 1602, has been 
based on Sir Thomas Bodley's vision of a library serving fhe worldwide 
'Republic of Leffers', wifh fhe Library's collecfions open fo all who have 
need fo use fhem," explained Ronald Milne, Acting Director of Oxford 
Universify Library. "The Google Library Projecf in Oxford testifies fo 
our ongoing commifmenf fo enable and facilifafe access fo our confenf 
for fhe scholarly communify and beyond."^ 

Showing a hinf of scholarly skepficism. Harvard's fhen-presidenf, 
Lawrence H. Summers, hedged a bif: "If fhis experimenf is successful, 
we have fhe potential fo provide fhe world's greafesf system for 
disseminafion."lO In any evenf. Harvard and ofhers joined wifh Google 
knowing fhaf fheir arrangemenfs were nof exclusive, fhus protecting fheir 
fundamenfal inferesf. 

Beyond Google 

Google's 2004 announcemenf of five agreemenfs was as dose fo a preemptive 
sfrike as could be imagined, buf if was sfill only fhe beginning. Ofhers have 
since joined fhe fray, libraries and businesses alike. 

Aparf from Google's confroversial approach fo 
in-copyrighf maferials,ll fhe deals being sfruck 
have been largely framed by Google's firsf -mover 
advanfage. As one library direcfor reporfed 
privately, Microsoff inifially proposed ferms fhaf 
were 180 degrees differenf from Google's, buf as 
fhey've goffen down fo brass facks, fhe agreemenf 
has sfarfed fo look a lof like Google's. 

Ulfimafely, fhough, more experience fogefher wifh compefifion for use of 
insfifufions' resources could mean beffer ferms for fhe insfifufions. Libraries 
would do well fo rejecf fhe notion fhaf fhe confracfs execufed fo dafe have 
esfablished fhe norms for fufure agreemenfs. 

The Open Confenf Alliance, esfablished in November 2005, pointedly 
embraced principles of open accessibilify. Organizafions fhaf are confribufing 
fo fhe OCA have indicafed fheir supporf of several broad principles: 

1. The OCA will encourage fhe greafesf possible degree of access fo and 
reuse of collecfions in fhe archive, while respecting fhe righfs of confenf 
owners and confribufors. 



8 University of California Office of 
the President, "UC Libraries 
Partner with Google to Digitize 
Books," August 9, 2006, http: / / 
www.universityofcalifornia.edu / 
news / 2006 / aug09.html. 

9 Google Books Search Library 
Partners, http: / /books. 
google.com/googlebooks/ 
partners.html. 

10 "Harvard Libraries and Google 
Announce Pilot Digitization 
Project with Potential Benefits to 
Scholars Worldwide," Harvard 
University Gazette, Dec. 14, 2004, 
http:/ / www.hno.harvard.edu/ 
gazette / daily / 2004 / 12 / 
13-google.html. 

11 For more on this rich topic, see: 
Jonathan Band, "The Google Print 
Library Project: A Copyright 
Analysis," ARL: A Bimonthly 
Report, no. 242 (Oct. 2005): 6-9, 
http:/ /www.arl.org/bm~doc/ 
arlbr242google.pdf. 



Libraries would do well to reject the notion that 
the contracts executed to date have established 
the norms for future agreements. 




2. Confribufors will defermine fhe ferms and condifions under which 
fheir collecfions are disfribufed and how attribution should be made. 

3. The OCA need nof be obligafed fo accepf all confenf fhaf is offered fo if 
and may give preference fo fhaf which can be made widely accessible. 

4. The OCA will offer collection and ifem-Ievel mefadafa of ifs hosfed 
collections in a variefy of tormafs. 

5. The OCA welcomes efforfs fo creafe and offer fools (including finding 
aids, cafalogs, and indexes) fhaf will enhance fhe usabilify of fhe 
maferials in fhe archive. 
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6. Copies of the OCA collections will reside in multiple archives 
internationally to ensure their long-term preservation and 
accessibility to all. 12 

According to OCA's Brewster Kahle: 



12 Open Content Alliance, 

"A Call to Participate in the 
Open Content Alliance," http: / / 
www.opencontentalliance.org / 
participate.html. 



In essence, we want to get the rules right, to enable libraries to 
work with commercial sources, governments, etc., without having 
to hammer out separate agreements. Like the Open Source movement 
has done for software, we want the Open Content movement to do 
for institutions, to let them play a role. We want to define a way 
through the puzzle of who does what, to establish mechanisms 
for cooperation.l3 

The devil, of course, is in the details. The OCA Web site indicates that, 
"Generally, textual material will be free to read, and in most cases, available 
for saving or printing using formats such as PDF. Contributors to the OCA 
will determine the appropriate level of access to their content." However, it 
goes on to say, "OCA welcomes all efforts to create and offer tools (including 
finding aids, catalogs, and indexes) that will enhance the usability of the 
materials in the archive. 

OCA sidestepped much of the legal risk taken on by Google by targeting 
only material that is either in the public domain or has the copyright holders' 
authorization. But, being funded by its contributors and donations, OCA lacks 
the deep pockets of Google, which is amassing its trove of scanned books at far 
greater speed. 

Microsoft is one of the OCA participants and committed to fund the 
scanning of 150,000 public domain books. They also are making deals on 
their own, including large-scale digitization of books from the British Library, 
University of California, Cornell University, New York Public Library, 
University of Toronto, and the American Museum of Veterinary Medicine. 
According to the October 2006 Cornell announcement: 



13 Brewster Kahle, quoted in Barbara 
Quint, "Open Content Alliance 
Rises to the Challenge of Google 
Print," Information Today, October 
3, 2005, http: / / www.infotoday. 
com/ newsbreaks/nb051003-2. 
shtml. 

14 Open Content Alliance, "FAQ," 
http: / / www.opencontentalliance. 
org / faq.html. 

15 Gary Price, comment on 
"Microsoft Announces MSN 
Book Search; Joins Open Content 
Alliance," SearchEngineWatch 
Blog, comment posted October 26, 
2005, http:/ /blog.searchengine 
watch.com/blog/ 051026-000127. 

16 Cornell University Press Relations 
Office, "Cornell University 
Library Announces Partnership 
with Microsoft That Will Allow 
Global Access to Its World-Class 
Resources," October 17, 2006, 
http:/ / www.news.cornell.edu/ 
pressofficel / Oct06/library_ 
microsoft.shtml. 

17 Brewster Kahle, comment on 
"Announcing the Open Content 
Alliance," Yahoo! Search Blog, 
comment posted on October 2, 2005, 
http: / / www.ysearchblog.com/ 
archives / 2005_10.html. 



Microsoft will give the Library high-quality digital images of all the 
materials, allowing the Library to provide worldwide access through 
its own digital library and to share the content with non-commercial 
academic initiatives and non-profit organizations. ^6 



But whether third parties will be able to develop services using these 
files is not stated and none of the Microsoft contracts have been made public. 

The Microsoft digitization initiatives also are skirting thorny legal tie-ups 
by focusing on out-of-copyright scholarly material plus in-copyright books 
that are sent to Microsoft by publishers or authors for scarming by the company 
free of charge. 

Yahoo is part of OCA, too, and will index the OCA content. It reportedly is 
"funding the digitization of an initial corpus of American literature collection 
that the University of California system is selecting."l7 



What's Ahead? 

Given the broadening scope of book digitization projects now underway, it is 
reasonable to believe we will see more action in the months and years ahead to 
harness and exploit other kinds of knowledge assets. These might include: 

• unique and unpublished materials in special collections of libraries 
and archives; 
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• course materials; 

• theses, dissertations, and other student projects; 

• images, video, sound, and animations; 

• data sets; 

• works contained in institutional repositories; 

• government information; and 

• technical reports and other gray literature. 




Collaborations already reach beyond research universities. The New York 
Public Library was among the first wave of Google parfners and also is working 
wifh Microsoff. The Library of Congress is now conducfing a pilof projecf wifh 
Google. NASA is working wifh Google fo give fhe public easy access fo ifs mass 
of images and dafa. The Biodiversify Herifage Library, a cooperative of 
bofanical gardens and museums, is parf of OCA. As fhe commercial sector 
digs deeper, if's a safe bef fhaf affenfion will extend fo ofher culfural herifage 
organizafions and scienfific research cenfers. And as digifization eftorfs 
torn increasingly fo unique maferials, insfitofional negofiafing leverage 
may be greater. 

Given fhe prospecf of more deal-making, fhere is some popular senfimenf 
among research libraries fhaf fhe communify would be well served by a sef of 
principles fo be upheld in — or af leasf provide a fouchsfone for — digifization or 
similar agreemenfs. Such an asserfion would increase fhe under sfanding and 
negofiafing prowess of libraries and give weighf fo fheir posifions. 

Getting It Right 

Whefher or nof fhe provisions of Google's agreemenfs wifh libraries fo date 
have casf fhe mold for ofher negofiafions, fhey cerfainly are parf of fhe markef 
confexf. Forfunafely, negofiafions are already being guided by fhe sfrong values 
of fhe library culfure and fhe imporfance of information sharing fo fhe 
funcfioning of fhe academy. This influence is evidenf in fhe foundation of fhe 
Open Confenf Alliance — even if, in fhe OCA's eftorf fo unite a diverse group of 
academic and commercial collaborators, fhey have made some compromises. 

Buf fhe need for more fhan good infenfions was acknowledged af an 
April 2006 Digitization Policy Workshop organized by fhe American Library 
Association's Office of Information Technology Policy (OITP) Advisory 
Committee and attended by represenfafives from fhe library and culfural 
herifage communities. A sfafemenf fhaf emerged from fhe workshop observed: 

The Infernef has broughf fhe world of information fo classrooms, 
homes and offices of people worldwide. For over a decade now, 
libraries and ofher culfural herifage insfifufions have been opening 
fheir rare and fragile collections fo scholars and grade school children 
alike by digitizing fhese collecfions and making fhem available via fhe 
infernef. In fhe lasf year, libraries have been moving from smaller 
digitization projecfs fo mass digifization projecfs fhaf will eventoally 
make available whole collections, including millions of books. 

Funding agencies are supporfing research and demonsfrafion 
projecfs fhaf aid libraries and culfural herifage insfifufions in better 
undersfanding digitization processes, web harvesfing, fool 
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development, and assessment and evaluation. All of this has taken 
place without a coherent body of policy fo guide decision-making. 

In facf, over fhe pasf two years fhere have been a number of efforfs — formal 
and ofherwise — fo arficulafe core library inferesfs in digifizafion parfnerships. 
Following are some examples fhaf may help inform fhe agreemenfs fhaf have yef 
fo be negofiafed. 

American Library Association 

In the wake of ALA's Digifizafion Policy Workshop, fhe association's Office for 
Information Technology Policy consfifufed a Task Force on Digifizafion Policy, 
which draffed fhese "Principles for Digitized Confenf." The principles are fo be 
presenfed fo fhe ALA Council for discussion and pofenfial adoption in 2007: 



18 American Library Association, 
Task Force on Digitization Policy, 
"Background," http: / / www. 
ala.org/ ala/washoff / contactwo/ 
oitp / digtask.htm. 

19 American Library Association, 
Task Force on Digitization Policy, 
"Principles for Digitized Content 
(Draft)," http: / / www.ala.org/ 
ala / washof f / contactwo / oitp / 
digtask.htm# prin. 



1. Digifal libraries ARE libraries. The policies of fhe Associafion apply fully 
fo digifal libraries including fhe core values such as commifmenf fo 
access, confidenfialify/ privacy, fhe public good, and professionalism. 



2. Digifal confenf, like ofher library materials, musf be given fhe same 
consideration for collecfion developmenf, ease of access, freedom of 
information, and preservafion. 



3. Digifal activities and fhe resulfing collections musf be susfainable by 
libraries. Sustainability requires secure and ongoing funding, fechnology 
solufions fhaf are appropriafe fo fhe longevify of fhe culfural record, and 
long-ferm managemenf capabilifies. 



4. Digifizafion on a large scale requires collaboration. Collaboration enables 
the building of collections fhaf supporf research, scholarship and 
information needs of diverse communities. Collaboration will require 
sfrong organizafional supporf and promofion by culfural herifage 
professionals, fheir insfifufions, and fheir associations. 



5. Digifal acfivify requires ongoing communication for ifs success. The 
library and culfural herifage communify musf reach ouf fo fhe public, 
fo governmenf, and fo funding insfifufions wifh a clear and compelling 
message regarding fhe role of digifal libraries and collecfions. 



6. Digifal collecfions increasingly address an international audience. These 
collecfions are parf of a global information infrasfrucfure fhaf is nof 
limifed by geography. 



7. Digifal collecfions are developed and susfained by an educafed 

workforce. Members of fhe culfural herifage professions musf engage in 
continuous learning and be able fo explore new fechnology, fo work wifh 
new parfners, and fo reach new audiences. 



8. Digifal maferials musf be fhe objecf of appropriafe preservation. 

Preservation acfivities require fhe developmenf of sfandards and besf 
practices as well as models for susfainable funding fo guaranfee long 
ferm commifmenf fo fhese maferials. 



9. Digifal collecfions and fheir maferials musf adhere fo standards fo 
maximize fheir usefulness. Sfandards musf serve fhe broadesf 
communify of users, supporf susfainable access and use over time, and 
provide user funcfionalify fhaf promofes fhe core library values. 
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The Andrew W. Mellon Foundation 

In spring 2005, Donald Waters, Mellon Foundation Program Officer for 
Scholarly Communications, inifiafed an informal discussion wifh Daniel 
Greensfein of fhe California Digifal Library and ofhers on fhe need fo frame a sef 
of principles fhaf would inform fhe negofiafion of mass digifizafion agreemenfs. 
A draff fexf, never promulgafed, suggesfed fhaf libraries bear "a special 
responsibilify wifh regard fo fhe fufure use and disposition of fhe collections 
fhey have assembled and managed in large parf fhrough public invesfmenf, 
profecfion, and ofher special affordances." The draff fexf argued: 



"Upholding the Public Goods of 
Massively Digitized Library 
Holdings," unpublished 
manuscript obtained by the 
author from Daniel Greenstein 
on October 3, 2006. 



[M]assively digifized collections builf in whole or in parf on library 
holdings will uphold and supporf fhe public goods. . .where fhey can be 
confribufed fo massively digifized collecfions fhaf are: 

• persistent as whole collections whose long-ferm managemenf is 
governed by responsible fhird parfy archives empowered under 
specified business conditions fo make collecfion confenf generally 
and widely available, for example, when fhey are no longer accessible 
via ofher means 



• configurable for example fhrough fhe use of published and open 
sfandards fhaf supporf and enable fhird-parfy service developmenf 
and definifion 



• respectful of fhe righfs and prerogafives of copyrighf holders 

• attributable in a sfandard way fhaf makes if possible readily and 
fransparenfly fo identify fhe provenance of any ifem as well as fhe 
ferms and conditions fhaf govern ifs disfribufion and use 

• openly and freely accessible af leasf where collecfion and ifem-level 
mefadafa are concerned 



• created through the use of non-exclusive agreements fhaf preserve for 
confenf providing libraries (perhaps affer a fime-limifed period nof 
exceeding six years) fhe non-exclusive worldwide and perpefual 
righf fo manage and disfribufe digifal files fhaf are creafed from 
fheir holdings.^O 



Francophone National Libraries 

A high-level view emerged from a February 2006 meeting of six francophone 
national libraries, which endorsed fhese principles: 

• Agreemenfs wifh privafe sector parfners fo publish or digifize 
significanf collections will be non-exclusive in nafure, in bofh a legal 
and a de facfo sense. 



• Public access online fo publicly owned resources will remain free. 
Information providers may develop and charge for value-added 
feafures buf fhe source maferial should be accessible and free. 



i*i*i*i 



31 



• The digifal images will be prepared fo a suifable preservafion sfandard 
and mainfained in fhe public sector wifh a commifmenf fo long-ferm 
preservafion and accessibilify. 

• As far as possible, online access will be mulfi-lingual and mulfi-culfural 
as appropriafe for fhe source maferial. 
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• The integrity and authenticity of the original source material will be 
maintained and cannot be altered in the online environment .21 



Ingrid Parent, August 10, 2006, 
e-mail message obtained by 
the author. 



Committee on Institutional Cooperation 

The perspective of a consortium of large libraries emerged from fhe Committee 
on Insfifufional Cooperation (CIC) library direcfors, who endorsed a sef of 
principles in spring 2006. These principles relafe fo federating mass 
digifizafion efforfs among fhe CIC libraries. Several perfinenf principles are 
excerpfed here: 

• Digifizafion goals are for wide disfribufion, improved access mefhods 
(including fhe use of robusf mefadafa), preservafion of maferials for fhe 
fufure, abilify fo use confenf in new ways fhaf facilifafe new forms of 
research, abilify fo leverage fhe invesfmenfs of all CIC universities, and 
avoidance of unnecessary duplication of efforf. 



22 "Federating Mass Digital Efforts 
in the CIC Libraries, Endorsed by 
the CIC University Librarians, 
Spring 2006 (V.1.2)," unpublished 
manuscript obtained by the 
author. 

23 James Michalko, "Mass 
Digitization Partnerships — Rights 
and Responsibilities of Research 
Institutions," unpublished 
manuscript obtained by the 
author on October 22, 2006. 



• Recognizing fhaf sfandards are dynamic and evolving, fhe efforfs for 
digifizafion, access, mefadafa, storage, and preservafion will be based on 
generally accepfed and widely adopfed infemafional sfandards. 



• Resources combined fhrough fhe federafion will be "open access" 
fhroughouf fhe CIC fo fhe exfenf legally permissible, buf if is undersfood 
fhaf invesfmenfs will be necessary for fhe developmenf of fhe federafed 
resources fhaf mighf resulf in charges fo users beyond fhe CIC. 



• CIC libraries fhaf sign agreemenfs wifh vendors in order for fhose 
vendors fo digitize maferial from library collections agree fhaf such 
digifizafion will be done wifh fhe infenf of encouraging open access and 
discouraging re-purposing of fhe confenf for sale by fhe vendor.22 



OCLC's RLG Programs 

In a draff fexf shared in November 2006 wifh fhe ART Scholarly 
Communications Sfeering Committee, James Michalko of OCLC's RLG 
Programs division suggested several potential expecfafions of digifizafion 
parfnerships — fhese, foo, anchored in fhe notion of library collecfions as a 
public good, buf recognizing fhaf "expecfafions will be defined by sifuafional 
needs." Following is an excerpf: 



Digifized works supporf fhe public good and scholarly use when fhey 
exhibif affribufes fhaf enable fhe creafive use and re-use of collecfions 
builf and managed as parf of a common culfural herifage. Research 
insfifufions have a shared inferesf in securing fhe righfs necessary fo 
ensure fhaf fhe producfs of mass digifizafion supporf or enable: 

• recombination 



• deeper access 

• more immediafe access 



• more malleable fexf 

• derivafive formafs 

• links fo related research, dafa, and primary sources 

• aufhenficify23 




^ 9 
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American Council of Learned Societies 

In their 2006 report. Our Cultural Commonwealth, the American Council of 
Leaned Societies Commission on Cyberinfrastructure identified five "necessary 
characteristics of a robust cyberinfrastructure in the humanities and social 
sciences." Institutions' agreements to make their collections available on the 
Internet might usefully be evaluated against these five "necessary characteristics 
of a robust cyberinfrastructure": 

• It will be accessible as a public good. 



24 Mario Welshons, ed.. Our Cultural 
Commonwealth: The Report of the 
American Council of Learned 
Societies Commission on 
Cyberinfrastructure for the 
Humanities and Social Sciences 
(New York: American Council 
of Learned Societies, 2006), 
http: / / www.acls.org/ cyber 
infrastructure / OurCultural 
Commonwealth.pdf. 



• It will be sustainable. 

• It will provide interoperability. 

• It will facilitate collaboration. 



25 "CUL Large-Scale Digitization 
Initiative," unpublished 
manuscript dated September 2006, 
obtained by the author and 
quoted with permission. 



• It will support experimentation.24 



Cornell University 

Principles and objectives are not the same, but a look at Cornell University 
Library's objectives in entering into a collaboration with Microsoft — extracted 
from an internal document entitled, "Why Are We Collaborating with 
Microsoft?" — illuminates both the motivations of the institution's leaders and the 
principles that suffuse their culture [headings added]: 

• Advance global scholarship 

This partnership supports the Library's commitment to make its collections 
available to a wide audience in a high-quality digital format that meets 
users' sophisticated access and discovery requirements. We have a long- 
standing commitment to make our collections broadly available to support 
global scholarship. 

• Support the work of the institution 

By partnering with Microsoft, we participate in content selection decisions 
to ensure that not only popular but also scholarly materials that support the 
CUL's academic programs are available in digital form on the Web. 

• Enhance preservation 

By receiving all the digital scans back, we are not relying on a commercial 
entity to assume the preservation responsibility for either print or digital 
versions of this important scholarly content. 

• Expand scale of online resources 

During the first year of the partnership, up to 170,000 monographs and 
serials will be digitized and be accessible on the web. . .. Cornell has been 
working for fifteen years to digitize significant collections, and this 
effort... enhances CUL's leadership role in transforming our collections. 

• Improve return on investment 

Some of these materials are currently getting very low use. We believe that 
digitizing and making them available online will boost their use and will 
increase the long-term returns on our investment in the purchase, storage, 
and maintenance of Cornell's extensive print collections. 

• Ensure flexibility 

The agreement with Microsoft is non-exclusive and the Library is free 
to join other partnerships with the same or different content in digitizing 
its collection.25 
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Advancing Global Interests 

It is fair to assume that immediate local interests will mostly take care of 
themselves as institutions work out terms of partnerships that utilize their 
library collections. But, as the articulations above demonstrate, strategic, 
community-wide, and societal interests — ultimately ensuring a full flowering of 
the emerging global digital library — must be an explicit part of the negotiation 
and a legitimate object of applying available leverage. This obligation is of 
significance for public and private institutions alike. 

The US National Archives and Records Administration (NARA) recently 
learned the hard way the danger of failing to adopt a set of principles to guide 
their negotiation of a digitization agreement with Google. Reports of the terms 
of an imminent agreement were greeted by much of the library community with 
"concern about the ownership — physical and legal — of both the underlying 
documents and the subsequent digital files."26 As a consequence, NARA went 
back to the drawing board to develop a set of principles framing development of 
such NARA agreements. 27 

To aid institutions in conceptualizing the principles that may guide them, 
following is a simple clustering of some of the critical interests expressed above, 
colored by perspectives gleaned from a sampling of interviews during fall 2006 
with directors of six North American research libraries plus officials of three 
academic library organizations. This list is expressed not as principles but as 
broad issues and strategic interests against which potential agreement terms 



26 Association ot Research Libraries, 
American Association of Law 
Libraries, American Library 
Association, Special Libraries 
Association; letter to NARA; July 
6, 2006; http: / / www.arl.org/ 
bm~doc/ naragoogleletter.pdf. 

27 NARA subsequently announced a 
non-exclusive agreement with 
Footnote, Inc., to digitize selected 
records from the Archives' 
holdings. See: http: / / www. 
archives.gov / press / press- 
releases / 2007 / nr07-41 .html. 



can be evaluated. 



Exclusivity: As a practical matter, exclusive agreements risk ceding crucial 
rights over use and reuse of digital files made from the collections an 
institution has assembled and supported at great expense. Future 
opportunities in a fast-changing environment 
are hard to anticipate, so any restriction of 
rights has both local and societal 
implications. From a local perspective, 
exclusive deals may inhibit the ability to 
obtain enhanced value from that investment 
in years to come. From a broader 
perspective, exclusive deals stifle creative uses of collections. Of course, 
exclusivity comes in many sizes and shapes, potentially defined by time, 
copyright status, geography, media /formats, markets, or other factors. 
However, institutions and society benefit most when exclusivity in any 
form is minimized. 



...strategic, community -wide, and societal 
interests... must be an explicit part of the 
negotiation and a legitimate object of applying 
available leverage. 



• Uses of the digital files: The issues surrounding uses of the digitized works 
lie at the core of an institution's objectives in entering an agreement, 
which probably include expanding collection usage and supporting 
learning and new contributions to scholarship. As a practical matter, 
many of the uses envisaged in an agreement will need to be framed in 
light of the copyright status of works. While the rights to use public 
domain materials are wide open under statutory law — and should remain 
so in the environment framed by a contract — use rights with respect to 
in-copyright materials are less clear-cut and indeed may be interpreted by 
pending or future litigation. Within each of these two classes of work, the 
kinds of uses to be anticipated by a contract will likely touch not just 
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upon what can be done with the partner's product offering, buf also 
whefher all or parf of fhe digitized works can be rehosfed by fhe source 



28 a useful exploration of 

computation access to scholarly 



insfifufion or ofhers and whefher services fhaf inferacf wifh fhe dafabase 
can be implemenfed by fhe insfifufion or by fhird parties. While fhese 
services mighf very well include keyword searching and display of a 
resulf of specified scope (e.g., a "snippef" or somefhing more), a 
forward-looking agreemenf will enable emerging compufafional 
applicafions such as fexf mining.28 

Respect for the public domain and copyright: Recognizing fhe righfs of 
copyrighf holders is bofh a principled and a pracfical obligation, requiring 
fhaf arrangemenfs accounf for how information abouf ownership and use 
righfs will be conveyed. There is also a danger here fhaf, wifhouf explicif 
profecfions, resfricfive confracf provisions could supplanf use profecfions 
granfed under copyrighf law fhaf are invaluable fo scholarship. 

Academic insfifufions have an inferesf in preserving sfafufory fair use 
and educational exemptions and fhe sfafus of public domain works. 

As a maffer of principle and self-inferesf, agreemenfs fhaf infroduce 
resfricfions beyond fhose fhaf apply fo prinfed works should be resisfed. 
In a parfnership, fhere is also a danger fhaf fhere will be disagreemenfs 
abouf how fhe law will be inferprefed. In fheir agreemenf wifh Google, 
fhe Universify of Wisconsin profecfed againsf such an evenfualify by 
securing digifal copies of public domain works scarmed from fheir 
collection and fhe righfs fo use fhese. They plan fo make fhe works 
available fo fhe public under ferms of fhe universify's choosing. 



literature is found in: Clittord A. 
Lynch, "Open Computation: 
Beyond Human-Reader-Centric 
Views ot Scholarly Literatures," 
in Open Access: Key Strategic, 
Technical, and Economic Aspects, 
ed. Neil Jacobs (Oxford: Chandos 
Publishing, 2006), 185-193, 
http: / / www.cni.org / statt / 
clittpubs / OpenComputation.htm. 



• Preservation: All digitization projects have implications for preservation of 
works in the collection. An objective of digitization may or may not be to 
create a digital preservation copy, which has a bearing on maintenance of 
the corresponding print collection. Indeed, in some cases the act of 
digitizing printed material endangers or destroys the printed copy. The 
issues go beyond standards to questions of selection, collection integrity, 
quality control, handling, loss or damage, accountability, return of 
artifacts, and potential multi-institution initiatives to prune collections of 
duplicate items. Regardless of whether digitization creates a preservation 
copy, any agreement should address the need for persistent access to the 
digitized works. A strategy is needed for redundant storage of files in 
geographically dispersed locations and there are also questions of who 
will migrate content and what will be needed over the long term to enable 
this — technically, financially, and organizationally. 




• Use of standards: An institution's goals, especially with respect to use 
and preservation of works, are advanced when appropriate standards 
are employed by the entity that wishes to exploit the collection. The 
benefits of standards use may in some cases be greater for libraries than 
for commercial partners. Their use has implications for up-front costs 
but also maintenance costs and the need for refreshing of technology. 
Failure to apply existing standards can create intended or unintended 
exclusivity issues. 

• Quid pro quo: Assuring adequate valuation of the library's contribution to 
a partnership is akin to a fiduciary responsibility to the institution that 
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supports the collection. Measures of value might include royalties, cost 
savings, or acknowledgment of fhe library in producfs or services. Buf a key 
measure is fhe exfenf fo which an agreemenf advances fhe communify and 
sociefal benefifs of fasfer, less burdensome, more open information sharing. 

• Transparency: Because of fheir role in advancing and preserving 
knowledge, research libraries and fheir insfifufions (and mosf ofher 
culfural herifage insfifufions) enjoy enormous public frusf. Indeed, in fhe 
US, fhey are profecfed by sfafufory provisions such as fhe library privileges 
in copyrighf law and, in mosf cases, fhey benefif from being nof-for-profif 
organizafions under fhe fax code. If follows fhen fhaf fhey are accounfable 
fo fhe public for fhe ferms of fhe agreemenfs and fhe exfenf fo which fhe 
resulfs of a deal will serve fhe public good. Their sfafus brings wifh if a 
principled obligation fo be as open and fransparenf as possible in fheir 
business dealings. Even as a purely practical maffer, fhe public 
communicafion componenf of a deal needs fo be faken info accounf. 

As Smifhsonian Insfifufion officials acknowledged in fhe wake of fhe 
public oufcry over fheir confidenfial confracf wifh Showtime, "[we] may 
nof have done enough inifially fo inform ofhers abouf how fhe confracf 
would affecf existing Smifhsonian policies and procedures. "29 



29 Jacqueline Trescott, "Smithsonian 
Deal with Showtime Passes 
Muster," Washington Post, 
December 16, 2006, Cl, http: / / 
www.washingtonpost.com / 
wp-dyn/ content / article / 2006 / 
12/15 / AR2006121501894_pf.html. 

30 Kevin Kelley, "Scan This Book!" 
New York Times, May 14, 2006, 
http: / / www.nytimes.com/ 2006/ 
05 / 14 / magazine / 14publishing. 
html?ex=1305259200&en=c07443d 
368771bb8&ei=5090. 

31 Ross Atkinson, "Six Key 
Challenges for the Future of 
Collection Development," Library 
Resources & Technical Services 50, 
no. 4 (October 2006): 245. It is 
worth noting that Atkinson 
concluded individual libraries will 
never lose their competitive 
differentiation because "[e]ach has 
vastly different resources — not 
only financial, but also human 
and creative resources, including 
different visions and values." 



Good Stewardship 

The groundwork for fhe global digifal library — ^whaf's been described as "a single 
liquid fabric of inferconnecfed words and ideas''^^ — is faking shape in fhe broad 
wake of Google's ambitious book digitization program. In offering up fheir 



However, he cautioned that 
libraries' collection services 
would likely face new 
competitive forces. 



collections, research universities and ofher insfifufions have eifher overcome 



or guarded againsf whaf Ross Afkinson described 
as fheir "morbid fear. . .of becoming mirror images 
of each ofher as we move toward a more unified 
collecfion."3l 

Buf like mosf changes of such scale and 
complexify, fhis virfual library will grow in iterative 
fashion, faking accounf of fhe inferesfs of differing 
players, fhe attributes of differing kinds of works, fhe imperafives of differing 
legal circumsfances, fhe vigor of emerging fechnologies and sfandards, and an 
evolving comprehension of whaf is possible and necessary. 

Obviously, local circumsfances and objectives will play a prominenf role in 
mosf negofiafions fo make local resources available on fhe nefwork. Inevifably, 
fhe possible will be balanced againsf fhe pracfical. Buf fhe inferesfs of each 
individual insfifufion are besf served when all are negotiating vigorously for 
agreemenfs fhaf maximize fhe full pofenfial of nefworked digifal libraries. If fhe 
ulfimafe mission of fhe academy is fo advance knowledge, promofe feaching and 
research, and address fhe challenges fhaf face our sociefy, fhen resources 
sfewarded by libraries should be shared as widely as possible. 

As additional resources are fargefed for dissemination on fhe Infernef, 
libraries and fheir insfifufions should negofiafe sfrafegically. They should make 
full use of whafever leverage fhey have fo ensure fhe global digifal library is 
open and dynamic. 



...the interests of each individual institution are 
best served when all are negotiating vigorously 
for agreements that maximize the full 
potential of networked digital libraries. 
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Negotiation Checklist 



For a useful look at relevant 



T he following are key issues to weigh as your institution considers entering 
an agreement involving digitization or other business exploitation of its 
collections. For many questions below, the answer will differ for public 
domain vs. in-copyright material. These points only begin to outline the multiple 
layers of questions that institutions should ask before they enter significant 
agreements. Moreover, there are many additional practical issues to plan for — 
such as warranties and indemnifications, term and termination, cost 
apportionment, scheduling, and workflows.* 



models and elements ot business 
relationships between educational 
institutions and the commercial 
sector, see Peter B. Kaufman, 
"Marketing Culture in the Digital 
Age: A Report on New Business 
Collaborations between Libraries, 
Museums, Archives, and 
Commercial Companies", 

August 25, 2005, http: / / www. 
intelligenttele vision, com / 
MarketingCultureinDigitalAge.pdf. 



Exclusivity 

D Are you able to enter similar agreements with other parties? 

D Are there any restrictions on this right? 

D Where exclusivity may be justified, what is the maximum 
period over which it applies? 



D Are exceptions needed? Are there other ways exclusivity can be limited? 



Uses of the Digital Files 

n Does the agreement distinguish between uses of digitized public 
domain content vs. in-copyright content? 

n Will your institution be provided a copy of any transformation of 

works from your collection (along with associated metadata and image 
coordinates)? In what form will this be provided? 

□ What rights does the institution have to use the digital files and 
with whom may they be shared — members of your institutional 
community? Your consortium? Any user of the Web? 

□ What provisions are there for the library to manage the files (e.g., take 
down content when warranted, reformat, add enhancements)? 

□ Under what terms will end users have access to content? 

For how long are these terms guaranteed? 

□ Under what circumstances will users have access to different 
views of works (e.g., "snippets," "limited preview," 

full page views, downloadable PDFs, etc.)? 

□ Will works be attributable and employ durable identifiers? 

□ Will works be open to computational analysis (e.g., text mining)? 
Under what circumstances? 

□ Is there a right for third parties to create services (such as discovery 
or presentation tools) without unreasonable limitation? 

□ What uses are prohibited? 

□ Will restrictions lapse after a given period of time? 

n Are there provisions for any change in the library's rights if the other 
party becomes insolvent or no longer makes the files available? 



Respect for the Public Domain and Copyright 

□ Are copyright restrictions being inappropriately extended to 
public domain works? 
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□ who decides whether a work is in the public domain or not? 

□ Will treatment of currently in-copyright works change 
when they enter the public domain in the future? 

D Are fair use and educational profecfions of fhe copyrighf 
law preserved? 

Preservation 

n Are fhe formafs being used appropriafe for confenf migrafion over time? 

n Will characferisfics of fhe original publication 
(such as color or complex graphics) be replicafed? 

□ Will original ifems be refurned fo fhe collection? 

□ Will desfrucfive processes be used in digifizafion? 

How will fragile maferials be handled? 

□ Who will selecf ifems fo be included? 

□ Whaf arrangemenfs are fhere for qualify confrol? 

□ Whaf sfeps will be faken fo ensure fhe infegrify and ufilify of 
digifal files? How will fhis be funded? Who is responsible? 

□ Will copies of digifal files be kepf in mulfiple 
geographically dispersed locafions? 

D Will fhe works persisf as an identifiable collection? 



Standards 

D whaf sfandards will be adhered fo wifh respecf fo 
image qualify, mefadafa, and file formafs? 

n Will sfandards employed allow fhird parties fo develop services? 

n Will specified sfandards be non-propriefary and sufficienfly 
flexible fo accommodafe fhe changing environmenf? 

Quid pro Quo 

□ Is fhe library /insfifufion acknowledged in all uses granfed 
under fhe agreemenf? Whaf forms will fhis fake? 

□ Will fhe library /insfifufion receive royalties or ofher paymenfs? 
(Alfhough if is nof a confracfual issue, consider whefher 

fhis will be credifed fo fhe library's budgef.) 

D Are research, feaching, and learning advanced 
sufficienfly by fhe agreemenf? 

Transparency 

□ Have you refained fhe righf fo make a copy of fhe agreemenf 
available fo fhe public? 

□ How will you communicafe fo your insfifufional communify and fo fhe 
public abouf your goals in enfering fhe agreemenf, how fhe goals will 
be effecfed, and fhe sfeps you have faken fo advance fhe inferesfs of 
your communify and fhe public? 



Article and Checklist are copyright 
© 2007 Richard K. Johnson and are 
distributed under the terms of the 
Creative Commons Attribution- 
NonCommercial 2.5 license (http: / / 
creativecommons.org / licenses / 
bync/2.5/.) 
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ARL Calendar 2007 

http: / / www.arl.org / events / calendar / 



February 8-9 


ARL Board Meeting 


July 23-24 ARL Board Meeting 




Washington, DC 


Washington, DC 


March 12-16 


Service Quality Evaluation 


September 16-18 ARL Academy Leadership 




Academy 


Institute 




New Orleans, Louisiana 


Washington, DC 


April 16-17 


CNI Spring Task Force Meeting 


October 9-12 ARL Board & Membership 




Phoenix, Arizona 


Meeting 

Washington, DC 


April 21-26 


Research Libraries Leadership 
Fellows Institute I 


ARL Membership 
Meetings 2008 




Raleigh-Durham, North Carolina 


May 22-25 


ARL Board & Membership 






Meeting 

St. Louis, Missouri 


May 21-23, 2008, Coral Gables, Florida 
October 15-17, 2008, Washington, DC 


July 18-20 


Institute on Scholarly 
Communication 
Washington, DC 


Tentative Dates 


July 21-26 


Research Libraries Leadership 
Fellows Institute II 
Cambridge, Massachusetts 
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